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> A° ^ R X«^ U ^ 79- F © £jg t o T^HS V>«W*&«& # £T § fltfcfgE 
U^mmfc VfaW:. B 69cD^ >/1^ KX«^ U ^X°^ H ^ n - FT 

mzSizmmm^m^m^m^rc^y^^mx\^v^y o 9^(Dmn^m^ 
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;i/X (Bacillus) Slifflm\ ^ DX h U A (Clostridium) jglfflW, &V>«BH& 
m&mfZtl, (Bacillus) M^a^f^b^o Mfc* ^VA 

*&W<D®t3i®&m \,*T£.MT2> IW^ >;^KXU# U H i ITU 

M^.«Mffl, mm&m. itm^m. m&mm, mmmmm. mmtk&mmmt 

siaHff^o xm^ £ n, na^^ ©flgjt s jus^i* *> ^ 
sfcfe, mmx\z7fffi&fc~?z>mfc^£isxu, m^mmmo^m^^m^z^ 

\z, m^mj&mmxnmim, mmizum^m^mmumizm^r^mm^ 
s^si^c mfr&Mfc^&u, uM?>rt#m<D*km\z\-mmM^x&t> 

5S9§# £<fc -d T1H1$ nfeo 

*M#©#jtfe^©^f^ ffctt, ift*##R^flM6«, Nature, 390, 
249-256, (1997) TSR^frU JAFAN: Japan Functional Analysis Network 
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for Bacillus subtilis (BSORF DB) "C -T > 9 - * v V M ( 
http://bacillus. genome, ad. jp/) £ nfctt^^rV A^-^-lzm^^rmM 



1 





itW (kb) 


mm 


sigE 


1604 




sigF 


2443 




spoil SB 


1328 


mnmsim, m^m^m-¥ 


spoHE 


71 


mum, 7^7X^7#IWaFi?ffiM 


sigG 


1605 


SHIH-VBIi, 7^7Xl7»aGHf 


spoIVCB- spoIIIC 


2652-2701 





2 





(kb) 


mm 


spo I GA 


1604 




spoEAA 


2444 


mum, 7*7X#7#IWaF0fWi# 


spolVFB 


2855 




spoHR 


3794 


mum, mmmftmm a e m^mmtm-^ 


spoHIJ 


4213 


II-Vl, 7t77#7WaGHf)Sttftll# 



JFiftftt^fififfctsns. j^, 75 y mSH^I©ffil^fe«Lipiiian- 
Pearson& (Science, 227, 1435, (1985) ) \z ^-DXWiW^ti^> 0 
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X ttaj? U ^ 7°^ F <D 3lM iz£> ^T, ^ cd £M£ © ft Ju&tJijjfc $ n o 

mm£T&mfc^*wmx\ttmmk~rz\z\z, m^mmmm^z^^m 

CR&2fc«fcD^itfcB»Ifi : fl5ffrtt'tffiODNAI|ffr*ifXtS*ogS* 

a <h © na t? e » ste^ (dm o © t@ n mM%- &Mz\z-&%z\t\z£ 

^CD^W&D (Mol. Gen. Genet., 223, 268 (1990) , £5Lfc7j?£c£ 
II D tMT £ £. \Z £ o T, <Dm£MfL&#l 
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tlSo Bmffi#mcD^fg^iJ^«, Bfe-ffca7C»* (Oxidoreductase) , 

#* (Transferase) , Jn*^0#*l (Hydrolase) , BMBm (Lyase) , UttfcSllt 
(IsomeraseK <&j£ll* (Ligase/Synthetase) ^$n5*«, jEFjgfcra-fe;^ 

MfoWlziZ, &mtaykftBWm<Dftm (Biochem. J., 280, 309 (199D) *X 
Lipman-Pearson?£ (Science, 227, 1435, (1985)) K: J:oTtf-^$n§ 0 
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_hlB 0ge^>/^lX(WJ^7°?F &^tf D N A Dt>^ £ 31 MS fc "7° =? 

NA^iJ^^-W^^haFlfSn-ms igFjtfS^ (7 6 8 
bp) ^mi^TS^^ ^TO^(rfTX.«^f/i 0 
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splicing by overlap extent ion) -PCRS (Gene, 77, 61, (1989)) m\z£. 
X)> s i gFlWH^n H>.fcD±aEfi!l©DNA»f>T*i:Jl*Jt3l«>J:r)TSK 

(D £ it tTT z\ <Hz<k~o~U, s i gFfc&~?<DW]mz&Z>n$l&mQo<?)M 

s i g FiAfe^- mMWr^ : 2442658^2443425) 0±Si:iit§ 1 . 5 k b 
Bftf* (A) , StTFWgt^l. 5kb»rlt (B) ^7X>FpC19 
4*^<hbTiiiiisb7t^n^A7x-n-;i/iif'|^mW^^o. 9 k bmPr 

(C) (A) (B) (C) ©Hfc^SfiUCSOE-PCRfefCtoTjUS-a-S 

i*\ 3. 9kb0DNA|rjt^ffe o ££>DNA»rtf-£JB^T* n>b°^>h?i 
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Wt(^VAi©s i gFJie^^tr** (2442632-2443318) 

>^t«fcOflligSnfeo -7t\ iifSilWUHKlbT, ^rVAJbOs i gEjffg^ ( 
1604166^1604885) S^trfii* (1604136-1604976) , spoIISBjffc^ ( 
1347913^1348083) ©^a^$^tf«« (1347781-1348081) > spoIIE ( 
70536^73019) jl^^^^^t?^ (70537-73018) , s i g G«fg^ ( 
1605025^1605807) (1605083-1605877) , spoIVCBM 

fg^ (2652262-^2652732) £^rtHM (2652156-2652723) , XfcJu spoIVC 
Bjate^Ss p oIIIC*e^*Tf<Z>fi« (2652262^2701023) S^tHB** ( 
2652156-2701031) jWtiJBfcSn, ^n7A7x-3-jI/iftlgf!:iilfci 

X XX t°— (Bacillus sp. ) KSM-S 2 3 7tS*©7M'Jt;V7^if 
ite^ OftW2000-210081#&«) ®fM* (3. lkb) ^>t hM7^-pH 
Y3 0 OPLKCDB amH I ffi!ll6&ft«7»rj& $ tlTdB&Z 7°7^5 FpH 

Y-S 2 3 7 $, 7°ah^7^h^:J;oTtAl/&o ^W:i;oT#snfel 

#c£l 0mL©LB^iT-^3 7 <CT|gMi&*£fT Mfc£©i£*^0. 0 

5mL^5 0mL©2XL-'?M Xi&Jfi (2%hU7°h>> 1 %H®X^X, 

UNaCK 7. 5 %^;H — X, 7. 5 p p mg|7>j| > 4 - 5 
15ppmfh7^7U» fc&«U 3 0tT3BFI, «^*£fTo7c 0 

come*, g3t^ifctt, ^^M^gfj^^fflVi^^^^-rnfe, 
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ste^mfi (kb) 




sigE 


1604 


217, 


sigF 


2443 


212 


spoil SB 


1328 


140 


spoUE 


71 


21b 


sigG 


1605 


163 


spolVCB- spoDIC 


2652-2701 


141 


spolVCB 


2652 


141 






100 
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4. xtm^mifl, m^&j&mm^ mum. mm* x^vmizmmv, 

5. wmx^m^ih^n^mm^* *£^»©sigE , sigF , spoiie, 

spoIISBS^sigG©^-rn^> spoIVCB^S spoil K2ZT?<Dmm\Z'S2itlZ>XL&~? 

ttm&irzbcD-e&zmjm 6 x» 7 tBm©mmx.iiMo 
9 . m$m 6-8 ©v^rn^ 1 m&m<om^m^m >;^ix^ u ^ 



11 



WO 02/097064 



PCT/JP02/05151 



SEQUENCE LISTING 

<110> KAO CORPORATION 

<120> Host microorganisms 

<130> KS0660 

<140> 
<141> 

<150> JP P2001-160520 

<151> 2001-05-29 

<160> 2 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 3150 
<212> DNA 

<213> Bacillus sp. KSM-S237 

<220> 
<221> CDS 

<222> (573).. (3044) 
<220> 

<221> sig_peptide 
<222> (573).. (659) 

<220> 

<221> mat_peptide 
<222> (660).. (3044) 

<400> 1 

gatttgccga tgcaacaggc ttatatttag aggaaatttc tttttaaatt gaatacggaa 60 

taaaatcagg taaacaggtc ctgattttat ttttttgagt tttttagaga actgaagatt 120 

gaaataaaag tagaagacaa aggacataag aaaattgcat tagttttaat tatagaaaac 180 

gcctttttat aattatt tat acctagaacg aaaatactgt ttcgaaagcg gtttactata 240 

aaaccttata ttccggctct tttttaaaac agggggtaaa aattcactct agtattctaa 300 
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tttcaacatg ctataataaa tttgtaagac gcaatatgca tctctttttt tacgatatat 360 

gtaagcggtt aaccttgtgc tatatgccga tttaggaagg ggggtagait gagtcaagta 420 

gtaataatat agataactta taagttgttg agaagcagga gagcatctgg gttactcaca 480 

agttttttta aaactttaac gaaagcactt tcggtaatgc ttatgaattt agctatttga 540 

ttcaattact ttaaaaatat ttaggaggta at atg atg tta aga aag aaa aca 593 

Met Met Leu Arg Lys Lys Thr 
-25 

aag cag ttg att tct tec att ctt att tta gtt tta ctt eta tct tta 641 
Lys Gin Leu lie Ser Ser He Leu He Leu Val Leu Leu Leu Ser Leu 
-20 -15 -10 

ttt ccg gca get ctt gca gca gaa gga aac act cgt gaa gac aat ttt 689 
Phe Pro Ala Ala Leu Ala Ala Glu Gly Asn Thr Arg Glu Asp Asn Phe 
-5 -11 5 10 

aaa cat tta tta ggt aat gac aat gtt aaa cgc cct tct gag get ggc 737 
Lys His Leu Leu Gly Asn Asp Asn Val Lys Arg Pro Ser Glu Ala Gly 
15 20 25 

gca tta caa tta caa gaa gtc gat gga caa atg aca tta gta gat caa 785 
Ala Leu Gin Leu Gin Glu Val Asp Gly Gin Met Thr Leu Val Asp Gin 
30 35 40 

cat gga gaa aaa att caa tta cgt gga atg agt aca cac gga tta cag 833 
His Gly Glu Lys He Gin Leu Arg Gly Met Ser Thr His Gly Leu Gin 
45 50 55 

tgg ttt cct gag ate ttg aat gat aac gca tac aaa get ctt tct aac 881 
Trp Phe Pro Glu He Leu Asn Asp Asn Ala Tyr Lys Ala Leu Ser Asn 
60 65 70 

gat tgg gat tec aat atg att cgt ctt get atg tat gta ggt gaa aat 929 
. Asp Trp Asp Ser Asn Met He Arg Leu Ala Met Tyr Val Gly Glu Asn 
75 80 85 90 

ggg tac get aca aac cct gag tta ate aaa caa aga gtg att gat gga 977 
Gly Tyr Ala Thr Asn Pro Glu Leu lie Lys Gin Arg Val He Asp Gly 
95 100 105 

att gag tta gcg att gaa aat gac atg tat gtt att gtt gac tgg cat 1025 
He Glu Leu Ala He Glu Asn Asp Met Tyr Val He Val Asp Trp His 
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110 115 120 

gtt cat gcg cca ggt gat cct aga gat cct gtt tat gca ggt get aaa 

Val His Ala Pro Gly Asp Pro Arg Asp Pro Yal Tyr Ala Gly Ala Lys 
125 130 135 

gat ttc ttt aga gaa att gca get tta tac cct aat aat cca cac att 

Asp Phe Phe Arg Glu He Ala Ala Leu Tyr Pro Asn Asn Pro His He 
140 145 150 

att tat gag tta gcg aat gag ccg agt agt aat aat aat ggt gga gca 

He Tyr Glu Leu Ala Asn Glu Pro Ser Ser Asn Asn Asn Gly Gly Ala 

155 160 165 170 

ggg att ccg aat aac gaa gaa ggt tgg aaa gcg gta aaa gaa tat get 

Gly He Pro Asn Asn Glu Glu Gly Trp Lys Ala Val Lys Glu Tyr Ala 

175 180 185 

gat cca att gta gaa atg tta cgt aaa age ggt aat gca gat gac aac 

Asp Pro He Val Glu Met Leu Arg Lys Ser Gly Asn Ala Asp Asp Asn 
190 195 200 

att ate att gtt ggt agt cca aac tgg agt cag cgt ccg gac tta gca 

He He He Val Gly Ser Pro Asn Trp Ser Gin Arg Pro Asp Leu Ala 
205 210 215 

get gat aat cca att gat gat cac cat aca atg tat act gtt cac ttc 

Ala Asp Asn Pro He Asp Asp His His Thr Met Tyr Thr Val His Phe 
220 225 230 

tac act ggt tea cat get get tea act gaa age tat ccg tct gaa act 

Tyr Thr Gly Ser His Ala Ala Ser Thr Glu Ser Tyr Pro Ser Glu Thr 

235 240 245 250 

cct aac tct gaa aga gga aac gta atg agt aac act cgt tat gcg tta 

Pro Asn Ser Glu Arg Gly Asn Val Met Ser Asn Thr Arg Tyr Ala Leu 

255 260 265 

gaa aac gga gta gcg gta ttt gca aca gag tgg gga acg agt caa get 

Glu Asn Gly Val Ala Val Phe Ala Thr Glu Trp Gly Thr Ser Gin Ala 
270 275 280 

agt gga gac ggt ggt cct tac ttt gat gaa gca gat gta tgg att gaa 

Ser Gly Asp Gly Gly Pro Tyr Phe Asp Glu Ala Asp Val Trp He Glu 
285 290 295 
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1073 



1121 



1169 



1217 



1265 



1313 



1361 



1409 



1457 



1505 



1553 
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ttt tta aat gaa aac aac ait age igg get aac tgg tct tta acg aat 
Phe Leu Asn Glu Asn Asn He Ser Trp Ala Asn Trp Ser Leu Thr Asn 
300 305 310 

aaa aat gaa gta tct ggt gca ttt aca cca ttc gag tta ggt aag tct 
Lys Asn Glu Val Ser Gly Ala Phe Thr Pro Phe Glu Leu Gly Lys Ser 
315 320 325 330 

aac gca acc aat ctt gac cca ggt cca gat cat gtg tgg gca cca gaa 
Asn Ala Thr Asn Leu Asp Pro Gly Pro Asp His Val Trp Ala Pro Glu 
335 340 345 

gaa tta agt ctt tct gga gaa tat gta cgt get cgt att aaa ggt gtg 
Glu Leu Ser Leu Ser Gly Glu Tyr Val Arg Ala Arg lie Lys Gly Val 
350 355 360 

aac tat gag cca ate gac cgt aca aaa tac acg aaa gta ctt tgg gac 
Asn Tyr Glu Pro He Asp Arg Thr Lys Tyr Thr Lys Val Leu Trp Asp 
365 370 375 

ttt aat gat gga acg aag caa gga ttt gga gtg aat teg gat tct cca 
Phe Asn Asp Gly Thr Lys Gin Gly Phe Gly Val Asn Ser Asp Ser Pro 
380 385 390 

aat aaa gaa ctt att gca gtt gat aat gaa aac aac act ttg aaa gtt 
Asn Lys Glu Leu He Ala Val Asp Asn Glu Asn Asn Thr Leu Lys Val 
395 400 405 410 

teg gga tta gat gta agt aac gat gtt tea gat ggc aac ttc tgg get 
Ser Gly Leu Asp Val Ser Asn Asp Val Ser Asp Gly Asn Phe Trp Ala 
415 420 425 

aat get cgt ctt tct gee aac ggt tgg gga aaa agt gtt gat att tta 
Asn Ala Arg Leu Ser Ala Asn Gly Trp Gly Lys Ser Val Asp He Leu 
430 435 440 

ggt get gag aag ctt aca atg gat gtt att gtt gat gaa cca acg acg 
Gly Ala Glu Lys Leu Thr Met Asp Val He Val Asp Glu Pro Thr Thr 
445 450 455 

gta get att gcg gcg att cca caa agt agt aaa agt gga tgg gca aat 
Val Ala He Ala Ala He Pro Gin Ser Ser Lys Ser Gly Trp Ala Asn 
460 465 470 

cca gag cgt get gtt cga gtg aac gcg gaa gat ttt gtc cag caa acg 
Pro Glu Arg Ala Val Arg Val Asn Ala Glu Asp Phe Val Gin Gin Thr 
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1649 
1697 
1745 
1793 
1841 
1889 
1937 
1985 
2033 
2081 
2129 
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475 480 485 490 

gac ggt aag tat aaa get gga tta aca att aca gga gaa gat get cct 

Asp Gly Lys Tyr Lys Ala Gly Leu Thr He Thr Gly Glu Asp Ala Pro 

495 500 505 

aac eta aaa aat ate get ttt cat gaa gaa gat aac aat atg aac aac 

Asn Leu Lys Asn He Ala Phe His Glu Glu Asp Asn Asn Met Asn Asn 

510 515 520 

ate att ctg ttc gtg gga act gat gca get gac gtt att tac tta gat 

lie He Leu Phe Val Gly Thr Asp Ala Ala Asp Val He Tyr Leu Asp 

525 530 535 

aac att aaa gta att gga aca gaa gtt gaa att cca gtt gtt cat gat 

Asn lie Lys Val He Gly Thr Glu Val Glu He Pro Val Val His Asp 

540 545 550 

cca aaa gga gaa get gtt ctt cct tct gtt ttt gaa gac ggt aca cgt 

Pro Lys Gly Glu Ala Val Leu Pro Ser Val Phe Glu Asp Gly Thr Arg 

555 560 565 570 

caa ggt tgg gac tgg get gga gag tct ggt gtg aaa aca get tta aca 

Gin Gly Trp Asp Trp Ala Gly Glu Ser Gly Val Lys Thr Ala Leu Thr 

575 580 585 

att gaa gaa gca aac ggt tct aac gcg tta tea tgg gaa ttt gga tat 

He Glu Glu Ala Asn Gly Ser Asn Ala Leu Ser Trp Glu Phe Gly Tyr 

590 595 600 

cca gaa gta aaa cct agt gat aac tgg gca aca get cca cgt tta gat 

Pro Glu Val Lys Pro Ser Asp Asn Trp Ala Thr Ala Pro Arg Leu Asp 

605 610 615 

ttc tgg aaa tct gac ttg gtt cgc ggt gag aat gat tat gta get ttt 

Phe Trp Lys Ser Asp Leu Val Arg Gly Glu Asn Asp Tyr Val Ala Phe 

620 625 630 

gat ttc tat eta gat cca gtt cgt gca aca gaa ggc gca atg aat ate 

Asp Phe Tyr Leu Asp Pro Val Arg Ala Thr Glu Gly Ala Met Asn He 

635 640 645 650 

aat tta gta ttc cag cca cct act aac ggg tat tgg gta caa gca cca 

Asn Leu Val Phe Gin Pro Pro Thr Asn Gly Tyr Trp Val Gin Ala Pro 

655 660 665 



PCT/JP02/05151 



2177 



2225 



2273 



2321 



2369 



2417 



2465 



2513 



2561 



2609 



2657 
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aaa acg tat acg att aac ttt gat gaa ita gag gaa gcg aat caa gta 2705 
Lys Thr Tyr Thr He Asn Phe Asp Glu Leu Glu Glu Ala Asn Gin Val 
670 675 680 

aat ggt tta tat cac tat gaa gtg aaa att aac gta aga gat att aca 2753 
Asn Gly Leu Tyr His Tyr Glu Val Lys He Asn Val Arg Asp He Thr 
685 690 695 

aac att caa gat gac acg tta eta cgt aac atg atg ate att ttt gca 2801 
Asn He Gin Asp Asp Thr Leu Leu Arg Asn Met Met He He Phe Ala 
700 705 710 

gat gta gaa agt gac tit gca ggg aga gtc ttt gta gat aat gtt cgt 2849 
Asp Val Glu Ser Asp Phe Ala Gly Arg Val Phe Val Asp Asn Val Arg 
715 720 725 730 

ttt gag ggg get get act act gag ccg gtt gaa cca gag cca gtt gat 2897 
Phe Glu Gly Ala Ala Thr Thr Glu Pro Val Glu Pro Glu Pro Val Asp 
735 740 745 

cct ggc gaa gag acg cca cct gtc gat gag aag gaa gcg aaa aaa gaa 2945 
Pro Gly Glu Glu Thr Pro Pro Val Asp Glu Lys Glu Ala Lys Lys Glu 
750 755 760 

caa aaa gaa gca gag aaa gaa gag aaa gaa gca gta aaa gaa gaa aag 2993 
Gin Lys Glu Ala Glu Lys Glu Glu Lys Glu Ala Val Lys Glu Glu Lys 
765 770 775 

aaa gaa get aaa gaa gaa aag aaa gca gtc aaa aat gag get aag aaa 3041 
Lys Glu Ala Lys Glu Glu Lys Lys Ala Val Lys Asn Glu Ala Lys Lys 
780 785 790 

aaa taatctatta aactagttat agggttatct aaaggtctga tgtagatctt 3094 

Lys 

795 

ttagataacc tttttcttgc ataactggac acagagttgt tattaaagaa agtaag 3150 



<210> 2 
<211> 3332 
<212> DNA 

<213> Bacillus sp. KSM-64 

<220> 
<221> CDS 
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<222> (610). . (3075) 
<220> 

<221> sig_peptide 
<222> (610). . (696) 

<220> 

<221> matjeptide 
<222> (697).. (3075) 

<400> 2 

agtacttacc attttagagt caaaagatag aagccaagca ggatttgccg atgcaaccgg 60 

cttatattta gagggaattt ctttttaaat tgaatacgga ataaaatcag gtaaacaggt 120 

cctgatttta tttttttgaa tttttttgag aactaaagat tgaaatagaa gtagaagaca 180 

acggacataa gaaaattgta ttagttttaa ttatagaaaa cgcttttcta taattattta 240 

tacctagaac gaaaatactg tttcgaaagc ggtttactat aaaaccttat attccggctc 300 

tttttttaaa cagggggtga aaattcactc tagtattcta atttcaacat gctataataa 360 

atttgtaaga cgcaatatac atcttttttt tatgatattt gtaagcggtt aaccttgtgc 420 

tatatgccga tttaggaagg gggtagattg agtcaagtag tcataattta gataacttat 480 

aagttgttga gaagcaggag agaatctggg ttactcacaa gttttttaaa acattatcga 540 

aagcactttc ggttatgctt atgaatttag ctatttgatt caattacttt aataatttta 600 

ggaggtaat atg atg tta aga aag aaa aca aag cag ttg att tct tec att 651 
Met Met Leu Arg Lys Lys Thr Lys Gin Leu He Ser Ser lie 
-25 -20 

ctt att tta gtt tta ctt eta tct tta ttt ccg aca get ctt gca gca 699 
Leu He Leu Val Leu Leu Leu Ser Leu Phe Pro Thr Ala Leu Ala Ala 
-15 -10 -5 -1 1 

gaa gga aac act cgt gaa gac aat ttt aaa cat tta tta ggt aat gac 747 
Glu Gly Asn Thr Arg Glu Asp Asn Phe Lys His Leu Leu Gly Asn Asp 
5 10 15 

aat gtt aaa cgc cct tct gag get ggc gca tta caa tta caa gaa gtc 795 
Asn Yal Lys Arg Pro Ser Glu Ala Gly Ala Leu Gin Leu Gin Glu Val 
20 25 30 
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gat gga caa atg aca tta gta gat caa cat gga gaa aaa att caa tta 
Asp Gly Gin Met Thr Leu Val Asp Gin His Gly Glu Lys He Gin Leu 
35 40 45 

cgt gga atg agt aca cac gga tta caa tgg ttt cct gag ate ttg aat 
Arg Gly Met Ser Thr His Gly Leu Gin Trp Phe Pro Glu lie Leu Asn 
50 55 60 65 

gat aac gca tac aaa get ctt get aac gat tgg gaa tea aat atg att 
Asp Asn Ala Tyr Lys Ala Leu Ala Asn Asp Trp Glu Ser Asn Met lie 
70 75 80 

cgt eta get atg tat gtc ggt gaa aat ggc tat get tea aat cca gag 
Arg Leu Ala Met Tyr Val Gly Glu Asn Gly Tyr Ala Ser Asn Pro Glu 
85 90 95 

tta att aaa age aga gtc att aaa gga ata gat ctt get att gaa aat 
Leu He Lys Ser Arg. Val He Lys Gly He Asp Leu Ala He Glu Asn 
100 105 HO 

gac atg tat gtc ate gtt gat tgg cat gta cat gca cct ggt gat cct 
Asp Met Tyr Val He Val Asp Trp His Val His Ala Pro Gly Asp Pro 
115 120 125 

aga gat ccc gtt tac get gga gca gaa gat ttc ttt aga gat att gca 
Arg Asp Pro Val Tyr Ala Gly Ala Glu Asp Phe Phe Arg Asp He Ala 
130 135 140 145 

gca tta tat cct aac aat cca cac att att tat gag tta gcg aat gag 
Ala Leu Tyr Pro Asn Asn Pro His He He Tyr Glu Leu Ala Asn Glu 
150 155 160 

cca agt agt aac aat aat ggt gga get ggg att cca aat aat gaa gaa 
Pro Ser Ser Asn Asn Asn Gly Gly Ala Gly He Pro Asn Asn Glu Glu 
165 170 175 

ggt tgg aat gcg gta aaa gaa tac get gat cca att gta gaa atg tta 
Gly Trp Asn Ala Val Lys Glu Tyr Ala Asp Pro He Val Glu Met Leu 
180 185 190 

cgt gat age ggg aac gca gat gac aat att ate att gtg ggt agt cca 
Arg Asp Ser Gly Asn Ala Asp Asp Asn He He He Vai Gly Ser Pro 
195 200 205 

aac tgg agt cag cgt cct gac tta gca get gat aat cca att gat gat 
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Asn Trp Ser Gin Arg Pro Asp Leu Ala Ala Asp Asn Pro He Asp Asp 
210 215 220 225 

cac cat aca atg tat act gtt cac ttc tac act ggt tea cat get get 1419 
His His Thr Met Tyr Thr Val His Phe Tyr Thr Gly Ser His Ala Ala 
230 235 240 

tea act gaa age tat ccg cct gaa act cct aac tct gaa aga gga aac 1467 
Ser Thr Glu Ser Tyr Pro Pro Glu Thr Pro Asn Ser Glu Arg Gly Asn 
245 250 255 

gta atg agt aac act cgt tat gcg tta gaa aac gga gta gca gta ttt 1515 
Val Met Ser Asn Thr Arg Tyr Ala Leu Glu Asn Gly Val Ala Val Phe 
260 265 270 

gca aca gag tgg gga act age caa gca aat gga gat ggt ggt cct tac 1563 
Ala Thr Glu Trp Gly Thr Ser Gin Ala Asn Gly Asp Gly Gly Pro Tyr 
275 280 285 

ttt gat gaa gca gat gta tgg att gag ttt tta aat gaa aac aac att 1611 
Phe Asp Glu Ala Asp Val Trp lie Glu Phe Leu Asn Glu Asn Asn He 
290 295 300 305 

age tgg get aac tgg tct tta acg aat aaa aat gaa gta tct ggt gca 1659 
Ser Trp Ala Asn Trp Ser Leu Thr Asn Lys Asn Glu Val Ser Gly Ala 
310 315 320 

ttt aca cca ttc gag tta ggt aag tct aac gca aca agt ctt gac cca 1707 
Phe Thr Pro Phe Glu Leu Gly Lys Ser Asn Ala Thr Ser Leu Asp Pro 
325 330 335 

ggg cca gac caa gta tgg gta cca gaa gag tta agt ctt tct gga gaa 1755 
Gly Pro Asp Gin Val Trp Val Pro Glu Glu Leu Ser Leu Ser Gly Glu 
340 345 350 

tat gta cgt get cgt att aaa ggt gtg aac tat gag cca ate gac cgt 1803 
Tyr Val Arg Ala Arg He Lys Gly Val Asn Tyr Glu Pro He Asp Arg 
355 360 365 

aca aaa tac acg aaa gta ctt tgg gac ttt aat gat gga acg aag caa 1851 
Thr Lys Tyr Thr Lys Val Leu Trp Asp Phe Asn Asp Gly Thr Lys Gin 
370 375 380 385 

gga ttt gga gtg aat gga gat tct cca gtt gaa gat gta gtt att gag 1899 
Gly Phe Gly Val Asn Gly Asp Ser Pro Val Glu Asp Val Val lie Glu 
390 395 400 
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aat gaa gcg ggc get tta aaa ctt tea gga tta gat gca agt aat gat 
Asn Glu Ala Gly Ala Leu Lys Leu Ser Gly Leu Asp Ala Ser Asn Asp 
405 410 415 

gtt tct gaa ggt aat tac tgg get aat get cgt ctt tct gee gac ggt 
Val Ser Glu Gly Asn Tyr Trp Ala Asn Ala Arg Leu Ser Ala Asp Gly 
420 425 430 

tgg gga aaa agt gtt gat att tta ggt get gaa aaa ctt act atg gat 
Trp Gly Lys Ser Val Asp lie Leu Gly Ala Glu Lys Leu Thr Met Asp 
435 440 445 

gtg att gtt gat gag ccg ace acg gta tea att get gca att cca caa 
Val He Val Asp Glu Pro Thr Thr Val Ser He Ala Ala He Pro Gin 
450 455 460 465 

ggg cca tea gee aat tgg gtt aat cca aat cgt gca att aag gtt gag 
Gly Pro Ser Ala Asn Trp Val Asn Pro Asn Arg Ala He Lys Val Glu 
470 475 480 

cca act aat ttc gta ccg tta gga gat aag ttt aaa gcg gaa tta act 
Pro Thr Asn Phe Val Pro Leu Gly Asp Lys Phe Lys Ala Glu Leu Thr 
485 490 495 

ata act tea get gae tct cca teg tta gaa get att gcg atg cat get 
He Thr Ser Ala Asp Ser Pro Ser Leu Glu Ala He Ala Met His Ala 
500 505 510 

gaa aat aac aac ate aac aac ate att ctt ttt gta gga act gaa ggt 
Glu Asn Asn Asn He Asn Asn He He Leu Phe Val Gly Thr Glu Gly 
515 520 525 

get gat gtt ate tat tta gat aac att aaa gta att gga aca gaa gtt 
Ala Asp Val He Tyr Leu Asp Asn He Lys Val He Gly Thr Glu Val 
530 535 540 545 

gaa att cca gtt gtt cat gat cca aaa gga gaa get gtt ctt cct tct 
Glu He Pro Val Val His Asp Pro Lys Gly Glu Ala Val Leu Pro Ser 
550 555 560 

gtt ttt gaa gac ggt aca cgt caa ggt tgg gac tgg get gga gag tct 
Val Phe Glu Asp Gly Thr Arg Gin Gly Trp Asp Trp Ala Gly Glu Ser 
565 570 575 

ggt gtg aaa aca get tta aca att gaa gaa gca aac ggt tct aac gcg 
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Gly Val Lys Thr Ala Leu Thr He Glu Glu Ala Asn Gly Ser Asn Ala 

580 585 590 

tta tea tgg gaa ttt gga tac cca gaa gta aaa cct agt gat aac tgg 

Leu Ser Trp Glu Phe Gly Tyr Pro Glu Val Lys Pro Ser Asp Asn Trp 

595 600 605 

gca aca get cca cgt tta gat ttc tgg aaa tct gac ttg gtt cgc ggt 

Ala Thr Ala Pro Arg Leu Asp Phe Trp Lys Ser Asp Leu Val Arg Gly 

610 615 620 625 

gaa aat gat tat gta act ttt gat ttc tat eta gat cca gtt cgt gca 

Glu Asn Asp Tyr Val Thr Phe Asp Phe Tyr Leu Asp Pro Val Arg Ala 

630 635 640 

aca gaa ggc gca atg aat ate aat tta gta ttc cag cca cct act aac 

Thr Glu Gly Ala Met Asn lie Asn Leu Val Phe Gin Pro Pro Thr Asn 

645 650 655 

ggg tat tgg gta caa gca cca aaa acg tat acg att aac ttt gat gaa 

Gly Tyr Trp Val Gin Ala Pro Lys Thr Tyr Thr He Asn Phe Asp Glu 

660 665 670 

tta gag gaa gcg aat caa gta aat ggt tta tat cac tat gaa gtg aaa 

Leu Glu Glu Ala Asn Gin Val Asn Gly Leu Tyr His Tyr Glu Val Lys 

675 680 685 

att aac gta aga gat att aca aac att caa gat gac acg tta eta cgt 

He Asn Val Arg Asp He Thr Asn He Gin Asp Asp Thr Leu Leu Arg 

690 695 700 705 

aac atg atg ate att ttt gca gat gta gaa agt gac ttt gca ggg aga 

Asn Met Met He He Phe Ala Asp Val Glu Ser Asp Phe Ala Gly Arg 

710 715 720 

gtc ttt gta gat aat gtt cgt ttt gag ggg get get act act gag ccg 

Val Phe Val Asp Asn Val Arg Phe Glu Gly Ala Ala Thr Thr Glu Pro 

725 730 735 

gtt gaa cca gag cca gtt gat cct ggc gaa gag acg ccg cct gtc gat 

Val Glu Pro Glu Pro Val Asp Pro Gly Glu Glu Thr Pro Pro Val Asp 

740 745 750 

gag aag gaa gcg aaa aaa gaa caa aaa gaa gca gag aaa gaa gag aaa 

Glu Lys Glu Ala Lys Lys Glu Gin Lys Glu Ala Glu Lys Glu Glu Lys 

755 760 765 
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gaa gca gta aaa gaa gaa aag aaa gaa get aaa gaa gaa aag aaa gca 3051 
Glu Ala Val Lys Glu Glu Lys Lys Glu Ala Lys Glu Glu Lys Lys Ala 
770 775 780 785 

ate aaa aat gag get acg aaa aaa taatctaata aactagttat agggttatct 3105 
He Lys Asn Glu Ala Thr Lys Lys 
790 

aaaggtctga tgeagatett ttagataace tttttttgea taactggaca tagaatggtt 3165 

attaaagaaa gcaaggtgtt tatacgatat taaaaaggta gcgattttaa attgaaacct 3225 

ttaataatgt cttgtgatag aatgatgaag taatttaaga gggggaaacg aagtgaaaac 3285 

ggaaatttct agtagaagaa aaacagacea agaaatactg caagctt 3332 
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